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ABSTRACT 

The He rsc/ie/-ATL AS is a survey of 550 square degrees with the Herschel Space Observatory 
( ^-» ) in five far-infrared and submillimetre bands. The first data for the survey, observations of a 

field 4x4 deg 2 in size, were taken during the Science Demonstration Phase, and reach a 5a 
noise level of 33.5 mJy/beam at 250 /im. This paper describes the source extraction meth- 
ods used to create the corresponding Science Demonstration Phase catalogue, which contains 
6876 sources, selected at 250 /im , within ~14 sq. degrees. SPIRE sources are extracted using 
a new method specifically developed for Herschel data; PACS counterparts of these sources 
are identified using circular apertures placed at the SPIRE positions. Aperture flux densities 
are measured for sources identified as extended after matching to optical wavelengths. The 
reliability of this catalogue is also discussed, using full simulated maps at the three SPIRE 
bands. These show that a significant number of sources at 350 and 500 /imhave undergone 
flux density enhancements of up to a factor of ^2, due mainly to source confusion. Correction 
factors are determined for these effects. The SDP dataset and corresponding catalogue will be 
available from http : / /www . h-atlas . org/. 
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1 INTRODUCTION 

The Herschel Astrophysical Terahertz Large Area Survey (H- 
ATLAS) survey is the largest, in time and area, of the extragalactic 
Open Time Key Projects to be carried out with the European Space 



E-mail: emma.rigby@nottingham.ac.uk; emmaerigby@gmail.com 



Agency (ESA) Herschel Space Observatory jPilbratt et aL]2 010P 
When complete it will cover ~550 square degrees of the sky, in 
five far-infrared and submillimetre bands (100, 160, 250, 350 and 

1 Herschel is an ESA space observatory with science instruments provided 
by European-led Principal Investigator consortia and with important partic- 
ipation from NASA. 
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500 /im ), to a 5<r depth of 33 mjy/beam at 250 . The predicted 
number of sources is ~ 200,000; of these ~40,000 are expected to 
lie within z < 0.3. A full description of the survey can be found in 
pales et al.lffijlO) . 

This paper presents the 250 /im selected source catalogue 
created from the initial H-ATLAS Science Demonstration Phase 
(SDP) observations. Eight papers based on this catalogue have al- 
ready been published in the A&A Herschel Special Issue ranging 
from the identification of blazars ( Gonzalez-Nuevo et al. 2010) and 
debris disks ( Thomps on et al.|2010l in th e SDP field, to determina- 
tions of the colours jAmblard et al.|2010) , source counts (Clements] 
|et al.|20 10 1, clustering (Maddox et al.|2010) and 250 /J,m luminosity 
function evolution ( |Dye et al.|2010 1 of the submillimetre popula- 




tion, as well as the star formation history of quasar host galaxies 
l |Serjeant et al.|2010) an d the dust energy balance of a nearby spiral 
galaxy (Baes et al.|2010) . 

The layout of the paper is as follows: Section [2] describes the (a) Original combined map 

SDP observations; Section[3]describes the source extraction proce- 
dure for the five bands; finally, Section [4] outlines the simulations 
used to quantify the reliability of the catalogue. For more details of 
the SDP data see |Pascale eTai~|p010l > and |Ibar et al~l(2010a| > for 
the SPIRE and PACS data reduction respectively, and Smith et al. 
(201 1 1 for the multiwavelength catalogue matching. 



2 HERSCHEL OBSERVATIONS 

The SDP observations for the H-ATLAS survey cover an area of 
~4°x4°, centred at a=09 h 05 m 30.0 a , S =00°30' 00.0" (J2000). 
This field lies within one of the regions of the GAMA (Galaxy and 
Mass Assembly) survey {Driver et al. ||20"09] > so optical spectra, 
along with additional multiwavelength data, are available for the 
majority of the low-redshift sources. 

The observations were taken in parallel-mode, which uses the 
Photodetector Array Camera and Spectrometer (PACS; Poglitsch et 
al. |2010|l and Spectral and Photometric Imaging REciever (SPIRE; 
Griffin et al. 2010) instruments simultaneously; two orthogonal 
scans were used to mitigate the effects of 1// noise. The time-line 
data were reduced using HIPE jOtt et al. |2010) . SPIRE 250, 350, 
and 500 ^im maps were produced using a naive mapping technique, 
after removing any instrumental temperature variations (Pascale et 
al. |2010) , and incorporating the appropriate flux calibration factors. 
Noise maps were generated by using the two cross-scan measure- 
ments to estimate the noise per detector pass, and then for each 
pixel the noise is scaled by the square root of the number of detec- 
tor passes. The SPIRE point spread function (PSF) for each band 
was determined from Gaussian fits to observations of Neptune, the 
primary calibrator for the instrument. Maps from the PACS 100 and 
160 /im data were produced using the PhotPro ject task within 
HIPE (Ibar et al. 2010a). A false colour combined image of a part 
of the three SPIRE maps is shown in Figure^ The measured beam 
full-width-half-maxima (FWHMs) are approximately 9", 13", 
18" , 25" and 35" for the 100, 160, 250, 350 and 500 /jm bands re- 
spectively (Ibar et af~[ 2010a; Pas cale et al. | 2010l. The map pixels 
are 2.5" , 5" , 5" , 10"and 10" in size for the same five bands. 

The noise levels measured by Pa scale et al. |p010| > for the 
250 /im and 500 SPIRE bands are in good agreement with those 
predicted using the Herschel Space Observatory Planning Tool 
(HSporb; for the 350 /im band they are considerably better. The 



(b) After background subtraction 

Figure 1. False-colour images of a 1.5 sq. degree region of the SDP field 
showing the three SPIRE bands combined. Image (a) is before background- 
subtraction and shows clear contamination by galactic cirrus; image (b) 
shows the reduction in contamination after subtracting the background. 

corresponding PACS noise levels determined by Ibar et al. (2010a) 
are currently higher than predicted (26 mjy and 24 mjy, compared 
with 13.4 mjy and 18.9 mJy for 100 /imand 160 [im respectively), 
but this may improve in future with better map-making tech- 
niques. The flux calibration uncertainties are 15% for the three 
SPIRE bands (Pascale et al. |2010) and 10 and 20% for the PACS 
100 /im and 160 /im bands respectively ( Ibar et al. |2010a| >. 



3 SOURCE EXTRACTION 

The ultimate aim for the source identification of the H-ATLAS 
data is to use a multiband method to perform extraction across the 
five wavebands simultaneously, thus utilising all the available data 
as well as easily obtaining complete flux density information for 
each detected galaxy, without having to match catalogues between 
bands. However, the short timescale for the reduction of these SDP 
observations, combined with the higher than expected PACS noise 



2 HIPE and HSpot are joint developments by the Herschel Science Ground 



Segment Consortium, consisting of ES A, the NASA Herschel Science Cen- 
ter, and the HIFI, PACS and SPIRE consortia 
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(b) Isolated source 

Figure 2. The input (true) and extracted position for two point sources in the 250 fim simulated maps before the addition of Gaussian noise (noiseless), after 
the noise has been added (noisy), and after further convolving with the 250 fim point spread function (PSF) to create the final realistic sky (PSF-filtered) (see 
Section|4]for full details), to illustrate how the position, and therefore flux density, of an extracted source found by MADX can be influenced by the presence 
of a close companion. 
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Figure 3. A comparison between the MADX and aperture measured fluxes for the sources with a possible optical identification in the matched catalogue of 
Smith et al. (201 1). Source identified as extended are highlighted in bold. 



levels, means that this was only possible for the three SPIRE bands. 
As a result, the source extraction for the PACS and SPIRE maps is 
discussed separately in this Section. 

The full H-ATLAS SDP catalogue described here will be 
available at http : / / www .h-atlas.org/. 



3.1 The SPIRE catalogue 

Sources are identified in the SPIRE 250, 350 and 500 fim maps us- 
ing the Multi-band Algorithm for source extraction (MADX, Mad- 
|dox et al. |201 1) , which is being developed for the H-ATLAS sur- 
vey. Several methods for generating the final SPIRE catalogue with 
MADX were investigated and these are described below. 

The first step in the MADX source extraction is to subtract 



a local background, estimated from the peak of the histogram of 
pixel values in 30 x 30 pixel blocks (chosen to allow the map 
to be easily divided up into independent sub-regions). This cor- 
responds to 2.5' x 2.5' for the 250 fim map, and 5' x 5' for the 350 
and 500 fim maps. The background (in mjy/beam) at each pixel was 
then estimated using a bi-cubic interpolation between the coarse 
grid of backgrounds, and subtracted from the data. Figure [T] il- 
lustrates the reduction in background contamination (mainly aris- 
ing from galactic cirrus, which dominates over the confusion noise 
from unresolved sources) obtained using this method. 

The background subtracted maps were then filtered by the 
estimated PSF, including an inverse variance weighting, where 
the noise for each map pixel was estimated from the noise map 
(matched filtering, e.g. Turin 1960, Serjeant et al. 2003 ). The back- 
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ground removal has a negligible effect on the PSF because the his- 
togram peak is insensitive to resolved sources in the background 
aperture; this will be discussed further in Mad dox et al. | ( [2011| >. 
We also create a 'filtered noise' map which represents the noise on 
a pixel in the PSF filtered map. This is lower than the raw noise 
map because the noise in the SPIRE pixels is uncorrected, and so 
filtering by the PSF reduces the noise by approximately the square 
root of the number of pixels per beam. 

The maps from the 350 and 500 /im bands are interpolated 
onto the 250 /im pixels. Then all three maps are combined with 
weights set by the local inverse variance, and the prior expec- 
tation of the spectral energy distribution (SED) of the galaxies. 
We used two SED priors: a flat-spectrum prior (assumed to be 
flat in /„), where equal weight is given to each band; and also 
250 fim weighting, where only the 250 /imband was included. 

Local, > 2.5cr, peaks are identified in the combined PSF fil- 
tered map as potential sources, and sorted in order of decreasing 
significance level. A Gaussian is fitted to each peak in turn to pro- 
vide an estimate of the position at the sub-pixel level; this can be 
influenced by the presence of a neighbouring source, as illustrated 
in Figure|2] but the effect is minimal. The flux in each band is then 
estimated using a bi-cubic interpolation to the position given by 
the combined map. The scaled PSF is then subtracted from the map 
before going on to the next source in the sequence. This ensures 
that flux from the wings of bright sources does not contaminate 
nearby fainter sources. This sorting and PSF subtraction reduces 
the effect of confusion, but in future releases we plan to implement 
multi-source fitting to blended sources. 

To produce a catalogue of reliable sources, a source is only 
included if it is detected at a significance of at least 5cr in one of the 
SPIRE bands. The total number of sources in the SPIRE catalogue 
is 6876. 

For our current data we chose to use the 250 ^im only prior 
for all our catalogues, which means that sources are identified at 
250 /im only. At the depth of the filtered maps source confusion is 
a significant problem, and the higher resolution of the 250 maps 
outweighed the signal-to-noise gain from including the other 
bands (see Section |4~Tj and Figure [8c}. This may introduce a bias in 
the catalogue against red, potentially high-redshift, sources that are 
bright at 500 fim , but weak in the other bands. However, compar- 
ing catalogues made with both the 250 fim and flat-spectrum pri- 
ors showed that the number of missed sources is low: 2974 > 5a 
350 /im sources and 348 > 5a 500 /im sources are detected with 
the flat prior, compared with 2758 and 307 sources detected us- 
ing the 250 ^m prior (i.e. 7% and 12% of sources are missed at 
350 /im and 500 /im respectively). It should also be noted that for 
a high-redshift source to be missed it would need a 500 /im to 
250/imfiux ratio of > 2.7 (i.e. it has to be < 2.5a at 250/imto 
be excluded from the catalogue). Assuming typical SED templates 
(e.g. M82 and Arp220), this means that this should only occur for 
sources which lie at redshifts > 4.6. We aim to revisit this issue in 
future data-releases. 

Since MADX uses a bicubic interpolation to estimate the peak 
flux in the PSF filtered map, it partially avoids the peak suppression 
caused by pixelating the time-line data, as discussed by Pascale et 
al. Nevertheless the peak fluxes are systematically underestimated, 
and so pixelization correction factors were calculated by pixelating 
the PSF at a large number of random sub-pixel positions. The mean 
correction factors were found to be 1.05, 1.11 and 1.04 in the 250, 
350 and 500 /im bands respectively, and they have been included in 
the released SDP catalogue. 

In calculating the a for each source, we use the filtered noise 
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Figure 4. The differential source counts from the PACS section of the SDP 
catalogue compared to the initial results from the three fields covered by the 
PEP survey ^Berta et al.|20T0) . 
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map and add the confusion noise to this in quadrature. The average 
la instrumental noise values are 4.1, 4.0 and 5.7 mjy/beam respec- 
tively, with 5% uncertainty, in the 250, 350 and 500 /im bands, de- 
termined from the filtered maps ( Pasc ale et al. |2010") . We estimated 
the confusion noise from the difference between the variance of the 
maps and the expected variance due to instrumental noise (assum- 
ing that confusion is dominating the excess noise), and find that the 
lcr confusion noise is 5.3, 6.4 and 6.7 mjy/beam at 250, 350 and 
500 /im , with an uncertainty of 7%; these values are in good agree- 
ment with those found by Nguyen et al. ( 2010 1 using data from the 
Herschel Multi-tiered Extragalactic Survey (HerMES). The result- 
ing average 5a limits are therefore 33.5, 37.7 and 44.0 mjy/beam. 
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3.1.1 Extended sources 

The flux density extracted by MADX will underestimate the 
true value for sources that are larger than the SPIRE beams, 
which have FWHM of 18.1", 24.8" and 35.2" for 250, 350 and 
500 /im respectively. This occurs because the peak value taken by 
MADX only accurately represents the true flux density of a source 
if it is point-like. These extended sources can be identified if they 
also have a reliable optical match and therefore a corresponding 
optical size, r opt (equivalent to the 25 mag arcsec~ 2 isophote), in 
the SDSS or GAMA catalogues (see |Smith et"aT1p0TT) for full de- 
tails of the matching procedure and the determination of the match 
reliability, Rj). The size of the aperture used is listed in the cata- 
logue, and the most appropriate flux density, either point source or 
aperture measurement (when this is larger), is given for each source 
in the SPIRE 'BEST flux' columns. It should be noted that, apart 
from two exceptions, this is necessary at 250 and 350 /im only, as 
the large 500 /imbeam size means that the flux discrepancy is neg- 
ligible for that map. 

An 'extended source', in a particular map, is defined here as 
one with r opt > 0.5 x FWHM, and to ensure only true matches 
are used, it must also have a match-reliability, Rj, greater than 
0.8. In total, the MADX 'BEST' flux columns for 167 sources at 
250 /im and 53 sources at 350 /mi were updated with aperture pho- 
tometry values. 

The aperture radius, o r , in a particular band is set by summing 
the optical size in quadrature with the FWHM of that band: 

a r = y^FWHM 2 + r1 pt . (1) 

The exceptions to this were the apertures used for sources H- 
ATLAS J091448.7-003533 (a merger, where the given a r is 
insufficient to include the second component) and H-ATLAS 
J090402.9+005436, which visual inspection showed was clearly 
extended. In these cases the aperture sizes used are chosen to match 
the extent of the sub-mm emission, and fluxes are replaced in the 
500 /im band as well. 

The apertures are placed on the MADX, Jy/beam, background 
subtracted maps, at the catalogue position for each source; the mea- 
sured values are converted to the correct flux scale by dividing by 
the area of the beam derived by Pasca le et al. | ( [20~1 ) for each map 
(13.9, 6.6 or 14.2 pixels for 250, 350 and 500 /im respectively). The 
corresponding la error is given by -^/Hap, where n ap is the sum 
of the variances within apertures placed in the same positions on 
the relevant variance maps. Confusion noise estimates were again 
added in quadrature to these uncertainties; these were scaled ac- 
cording to the area of each individual aperture. 

Figure [3] compares the MADX and aperture measured fluxes 
for all catalogue sources with a possible optical identification. It 
shows that the majority of objects are point-like, for which the 
agreement between the two sets of fluxes is good. The sources iden- 
tified as extended are highlighted in bold, and it is clear that MADX 
underestimates these at 250 and 350/imif they are brighter than 
-100 mJy. 

3.2 The PACS catalogue 

The higher noise levels in the PACS maps, along with the shape 
of the source SEDs, mean that all the PACS extragalactic sources 
should be clearly detected in the SPIRE catalogue. Sources in the 
PACS data are therefore identified by placing circular apertures at 
the SPIRE 250 /impositions in the 100 /mi and 160 /immaps, after 
correcting the PACS astrometry to match that of the 250 /mi map 



(using the sources present in both the SPIRE and PACS maps). 
There are two steps to this source detection process: first a 'point 
source' measurement is obtained for all SPIRE positions using 
apertures with radii of 10" (100 /im ) or 15" (160 /im ); next addi- 
tional aperture fluxes are found for positions where a PACS source 
would satisfy the extended source criteria discussed in Section 
|3.1.1| Aperture radii in this case are calculated using Equation[T| as- 
suming FWHM of 8.7" and 13.1" for 100 and 160 /xm respectively. 
These FWHM values are calculated using rough modelling of the 
Vesta asteroid as the full PACS PSFs are asymmetric (see Ibar et al. 
2010a, for a full discussion). 

The aggressive filtering used for these maps means that the 
large scale structure in the cirrus has already been removed, 
but some noise stripes remain. These are removed globally at 
160 /mi by subtracting a background determined within 10x10 
pixel blocks. However, at 100 /im this global approach was found to 
introduce negative holes around bright sources so the background 
value is determined for each source individually using a local an- 
nulus with a width of 0.5 times the aperture radius. 

Unlike SPIRE, the PACS maps have units of Jy/pixel so no 
beam conversion is needed. However, the fluxes are divided by 
1.09 (100 ^m) or 1.29 (160 /im) as recommended by the PACS 
Instrument Control Centr^] These scaling factors are now incor- 
porated into the data-reduction pipeline and have been applied to 
the public release of the PACS SDP maps, along with the astrome- 
try correction needed to match that of the SPIRE 250 /im map (this 
correction is ~l"in both PACS bands). The fluxes are also aper- 
ture corrected, using a correction determined from observations of 
a bright point-like source. The la errors are found using apertures 
randomly placed in the maps; note that these errors scale with aper- 
ture size. The low confusion noise compared to SPIRE, plus the fast 
scan speed used in these observations, means that the integration 
time used in H-ATLAS is insufficient to provide confusion limited 
images with PACS. Full details of these observations can be found 
in |Ibar et al."lP010a> . 

The most appropriate flux density measurements, either point 
or extended (where this is larger), are given in the 'BEST' PACS 
columns in the SDP catalogue, along with the corresponding aper- 
ture radii, for sources with S/N ^ 5. As a result 151 and 304 
sources satisfy this condition at 100 and 160 /im respectively. The 
5a point source limits in the PACS catalogue are 132 mjy and 
121 mJy at 100 and 160 /im. It should be noted that the flux 
densities extracted from the PACS maps are only at 100 /im and 
160 /mi under the assumption of a constant energy spectrum, 
though the colour corrections for sources with a different SED are 
small (Poglitsch et al.|2010| >. 

The PACS time-line data have been high-pass filtered by sub- 
tracting a boxcar median over 3.4 arcmin (at 100 /im ) and 2.5' at 
160/rnilib ar et al. |2010a) . The filtering will lead to the underes- 
timation of flux for sources extended on scales comparable to the 
filter length. The exact flux loss for a particular source will de- 
pend on the size of the source along the scan directions, and will 
also depend on whether the peak surface brightness is above the 
4a threshold used in the second level filtering. A simple simula- 
tion of a circular exponential disc shows that the filtering removes 
~ 50% of the source flux if the diameter of the disk is equal to the 
filter length. If the diameter is half of the filter length, then only 
5% of the flux is removed. This suggests that sources with a diame- 
ter less than 1' should by relatively unaffected by the filtering. Flux 



3 see the scan mode release note, PICCMETN0.35 
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Figure 6. The integrated source counts from the combined set of 500 input 
(true) and extracted simulated catalogues, along with those calculated using 
the SDP catalogue for both versions of the simulations. 
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lated extracted and input (true) full 250 fim catalogues. The results for the 
two versions are very similar, so only the PSS points are shown here. 



measurements for sources larger than this should be treated with 
caution. 

Figure [4] compares the differential source counts calculated 
from the PACS SDP catalogues to those determined from the initial 
data of the complementary PACS Evolutionary Probe (PEP) survey 
( |Berta et al.|2010| l, which is deeper than H-ATLAS but covers a 
smaller area. The good agreement between the two sets of counts 
supports the initial assumption that all bright PACS sources should 
already be present in the SPIRE catalogue. However, there are in- 
sufficient sources in the SDP data to properly constrain the bright 
number counts tail. A full analysis of the PACS counts will be pre- 
sented in < |Ibar et al. |2010b| l. 

For the sources detected in the PACS 100 ^im map an addi- 
tional comparison can be made to this wavelength in the Impe- 
rial IRAS-FSC Redshift Catalogue of |Wang & Rowan-Robinson| 
(20091, which combines the original IRAS Faint Source Catalogue 
flux density values with improved optical and radio identifications 
and redshifts. There are 34 IRAS sources within the PACS region 
of the H-ATLAS SDP field; 19 of these have a reliable IRAS flux 
measurement and these are in good agreement with the SDP cata- 
logue, with a mean offset consistent with zero, as shown in Figure 

LU 



4 ASSESSING THE CATALOGUE RELIABILITY 
4.1 Simulation creation 

It is not enough to identify sources in the H-ATLAS SDP maps; the 
robustness of the catalogue must also be determined. This is done 
using realistic simulations of the observations, with the same noise 
properties as the processed maps, and a realistic cirrus background, 
based on IRAS measurements ( Schlegel et al. 1998 1. However, only 
the three SPIRE bands are considered in this initial analysis, as 
the PACS SDP catalogue is currently treated as an extension to the 
SPIRE data 

The simulated maps are randomly populated with sources gen- 
erated using the models of Negrello et al. ( 2007 1, which predict the 
number counts of both the spheroidal and protospheroidal galaxy 
populations separately; for the simulations, these predictions are 
combined together to give the expected total counts, and hence 
the corresponding set of source flux densities, for each band. Al- 
though Maddox et al. (2010) detected, in SDP data, strong cluster- 
ing for 350 fim and 500 /xm-selected samples, fluctuations due to 
faint sources at the SPIRE resolution are Poisson dominated, espe- 
cially at 250 fim (e.g. |Negrello et al.|2004||Viero et aL|2009] >. This 
suggests that, for the present purposes, using unclustered random 
positions is a sufficiently good approximation. The flux densities 
of all the sources in the models are reduced by 26% at 250 fim and 
15% at 350^imto improve the agreement with the observed (i.e. 
uncorrected) source counts in the SDP catalogue (Clements et al. 
|2010[ l; the results of this alteration are shown in Figure [6] The fi- 
nal flux density ranges are 0.11 mjy - 1.65 Jy at 250 fim, 0.24 
mjy - 0.83 Jy at 350 /xmand 0.45 mjy - 0.59 Jy at 500 /xmfor the 
simulated sources; this ensures that the simulated maps contain a 
realistic background of faint sources which can contribute to the 
confusion noise. 

The simulations are constructed by first adding the flux of each 
source in each band to the relevant position in a 1 arcsecond grid. 
Two versions of the simulations are created. In the first the sim- 
ulated sources are all one pixel in size (point-source-simulations: 
PSS), whereas in the second the sources are assigned a scale-length 
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Figure 8. The positional errors for the two different versions of the simulations, alongside a comparison of the two different source extraction position priors as 
previously discussed in Section [3~T] Also shown in[8b]are the positional errors plotted against the S/N in the input (true) catalogue, along with those determined 
by |Smith et aT]{20TT) for the SDP data at 5, 7.5 and 10cr. 



based on their catalogue redshift (extended-source-simulations: 
ESS). The scale-length is constant in physical units, and then con- 
verted to an angular scale using standard cosmology. The ESS will 
obviously be a better representation of the real data, but, as Sec- 
tion [3TTTT] shows, MADX underestimates the flux densities of ob- 
jects with sizes larger than the FWHM, so the PSS simulations pro- 
vide a useful comparison. It should be noted that the flux densities 
and positions of the input sources will be the same in both cases. 
The next step is to convolve the 1 arcsecond map by the appropri- 
ate Herschel PSF, also sampled on a 1 arcsecond grid, to give a 
map of flux per beam covering the full area of the SDP data. Then, 
the 1 arcsecond pixels are block averaged to give 5 arcsecond pix- 
els for the 250 fimmaps, and 10 arcsecond pixels for the 350 and 
500 fim maps. 

A background representing emission from Galactic cirrus is 
then added to the each map. The background value is estimated 
from the Schle geTet al.| ( |T998] > map of 100 /im dust emission and 
temperature by assuming a modified black-body spectrum with 
P = 2.0, and scaling to the appropriate wavelength. The resolu- 
tion of this IRAS map is lower than that in the SDP data, which 
means that small scale structure in the cirrus is not present in the 
simulations. Since the cirrus is highly structured, it is non-trivial to 
generate realistic structure on smaller scales, so as a simple approx- 
imation, the low resolution maps were used, though it should be 
noted that the true cirrus background will include more small scale 
features. It is clear that the real cirrus structure in the SDP data is 
highly non-Gaussian, so simply extrapolating the power spectrum 
to smaller scales does not significantly improve the model back- 
ground. 

Finally instrumental noise is added to each pixel as a Gaussian 
deviate, scaled using the real coverage maps so that the local rms is 
the same as in the real data. 

Sources are then extracted with MADX from both versions of 
the simulations, following the procedure described in Section [JIT] 
For the ESS maps, the flux densities in the three bands are again 
replaced with aperture-measured values for the extended sources. 
The 'optical sizes' (needed to determine a r using Equation [TJ in 
this case are taken as three times the scale-size taken from the in- 
put catalogues; this corresponds to a 6-band isophotal limit of ~25 



mag arcsec i Zhong et al. 2008 1. Finally, the MADX catalogue is 



cut to only include sources which are detected at the 5<r level in 
any of the available bands. This process is repeated 500 times, each 
time using a different realisation of the input model counts, to en- 



sure sufficient numbers of bright sources are present at the longer 
wavelengths. The average number of extracted sources which are 
also >5<r in any band is 5881 and 5772 for PSS and ESS respec- 
tively, which is lower than the 6876 sources present in the real SDP 
data; as Figure [6] illustrates, this is because the simulated source 
counts do not exactly reproduce the real SDP ones. Additionally, 
more sources are found for the PSS version because of the flux 
underestimation of extended sources which means that the faintest 
objects fall below the catalogue cut. In the remainder of this discus- 
sion, these MADX catalogues will be referred to as the 'extracted 
catalogues', and the simulated input source lists as the 'simulated 
input catalogues'. 

For each of the three bands in turn, starting with the bright- 
est, sources in the extracted catalogue are matched to the simu- 
lated input source that makes the largest contribution, determined 
by weighting with the filtered beam, at that extracted position. A 
match radius of 3 pixels (approximately equal to the FWHM in 
each band) is also imposed to ensure that a match is not made to 
an unfeasibly distant source. Since the typical positional error for a 
> 5(J 250 /im source is 2.5" or less, this match radius will ensure 
that almost no real matches are rejected, whilst the weighting will 
avoid spurious matches. Once matched, a simulated input source is 
removed from consideration to avoid double-matches. Considering 
each band separately will allow an extracted source to have three 
different simulated input counterparts, depending on where the ma- 
jority of its flux density comes from at 250, 350 and 500 /im . This 
ensures that the effects of source blending in the data can be prop- 
erly investigated, though it should be noted that the results are very 
similar if the counterparts are found at the highest resolution, short- 
est wavelength only. Full simulated input, extracted and matched 
catalogues for each band are then made by combining the results 
from the 500 individual sets of simulations together. 

The positional offsets and corresponding errors are shown in 
Figures|7]and[8] They demonstrate that there is no significant offset 
between the extracted and matched catalogues. The positional er- 
rors for 5er sources are ~2.4" at 250 /im in both versions, which 
agrees with the value of 2.40 ± 0.11" found for the real SDP 
data by |Smith et al.| pOTT) . The errors also approximately scale 
as 1/(S/N) in the 250 /im band, as predicted by e.g. |Ivison et al.| 
(2007). However, at low S/N there is an enhancement over the pre- 
dicted values, as illustrated for the PSS results in Figure[8b] This is 
a result of Eddington bias causing more faint sources errors to scat- 
ter up than vice-versa; if the positional errors are plotted against the 
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Figure 10. The ratio of flux densities for the matched sources in the noiseless MADX C'S'noiseless) ™d extracted catalogues as a function of extracted signal 
to noise for the three bands (including point sources only). Also shown are the median and 3a clipped mean values, calculated in bins of 0.05 in log(S/N). 
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Figure 11. The ratio of flux densities for the matched sources in the simulated input (true) and extracted catalogues as a function of extracted signal to noise 
for the three bands, using the gridded position simulations (including point sources only). Also shown are the median and 3<r clipped mean values, calculated 
in bins of 0.05 in \og(S/N). Note that confusion noise is not included in these simulations. 



The H-ATLAS SDP catalogue 9 



250 u.m; >5a only 




6-io 5 r~ 

5-1 5 ^ 
4-1 5 i 

3* i \ 

2-1 5 i 

■•\v: 



o.o 



350 |im; >5a only 



0.2 



0.4 



0.6 



-*2ncl brightes/^brights 



4-1 4 



3-1 0*t 



2-1 4 H 



1-10 4 H 



1.0 



500 |im; >5a only 




'2nd brightest' °brightest 



Figure 12. The PSF-weighted ratio of the brightest to second brightest input (true) source contributing to the extracted source, within the beam in each band, 
for > 5cr sources in the extracted catalogue. 



8-10' 
6-1 ! 
4-1 0' 1 
2-1 ; 



250 urn 




>5o sources only 
>10o sources only 

> 1.5 . 10.5% 

> 2.0 . 1 .6% 

> 2.5 = 0.2% 

> 3.0 = 0.0% 



,/s,„ 



1.5-1 s - 



10 



350 urn 



10000 r 



>5o sources only 
>10o sources only 



500 um 



>5o sources only 
10a sources only 



n beam' °tm mat 
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S/N in the simulated input catalogue, which does not suffer from 
this effect, then they are in better agreement with the prediction. 

Figure [8c] also illustrates the improvement in positional er- 
rors that arises from selecting sources at 250 only in MADX, 
instead of giving equal weight to all bands (flat-spectrum prior), 
as previously discussed in Section [3~T| Greater positional accuracy 
significantly enhances the efficacy of the cross-identification to op- 
tical sources using the Likelihood Ratio method ( Smit h et al.|201 1\ . 
This is why the better positions are deemed to outweigh the slight 
chance of missing red objects when using the 250 fim prior. 



4.2 Catalogue correction factors 

Inspection of Figure [6] shows a clear discrepancy between the ex- 
tracted and simulated input integral counts at faint 500 /im flux den- 
sities; this occurs due to a combination of two factors. The first, 
flux-boosting, is a preferential enhancement of faint source flux 
densities due to positive noise peaks, that arises due to the steep- 

^OTTjf 



ness of the faint end (i.e. Ssoo/jm < 40 mJy;|Clements et al. 
of the source counts. The second is a result of blending, where sev- 
eral simulated input sources (which may be too faint to be included 
individually) are detected as one source in the extracted catalogue. 

These effects can be quantified by direct comparison of the 
simulated input and extracted flux densities, shown in Figure [9] as 
a function of signal-to-noise in the extracted catalogue for both 
the ESS and PSS versions. Flux correction factors are derived from 
the 3cr clipped mean of these data; these are given in Table[JJ Ap- 
plying these factors to each extracted source gives a statistically 
'flux-corrected' catalogue. It should be noted however, that the dis- 
cussion of correction factors in this Section is restricted to sources 
detected at a 5<r or greater level only. 



An alternative approach to determining the catalogue correc- 
tion factors is to use a 'noiseless' catalogue, created by running 
MADX on the simulated maps before the addition of noise, as the 
comparison. As Figure [TO] shows, this does not accurately repre- 
sent the level of flux-enhancement in the data, because, the noise- 
less catalogue is also affected by source blending. Additionally, at 
low S/N the noiseless-input flux densities are generally brighter 
than the extracted ones, suggesting that MADX underestimates the 
background subtraction in the absence of noise. 

The relative contributions from the flux-boosting and source 
blending can be investigated with a new set of simulated, point- 
source only, maps, in which the sources are placed on a regu- 
lar spaced grid, with a 70" separation between points, to ensure 
no sources overlap. The source density is also lowered in these 
maps (imposed by excluding any source in the simulated input cat- 
alogue with a 250 /im flux density fainter than 6.6 mjy), so that 
sufficient unique positions can be generated. Inspecting the ratio 
of the extracted and simulated input fluxes - Figure [77]- suggests 
that the majority of the flux-enhancement seen in Figure [9] is due 
to blended sources, rather than boosting due to noise. However, 
the PSF-weighted ratio of the brightest to second brightest simu- 
lated input source contributing to each source in the extracted cat- 
alogue (Figure [72) appears to contradict this; it shows that, even at 
500 /im , blending with this second source would not increase the 
extracted flux density by the amount seen. The solution to this ap- 
parent contradiction becomes clear when the PSF-weighted ratio 
of the contribution from all the simulated input sources within a 
beam to the flux density of the simulated input match is considered 
instead (Figure [73). Here ~27% of 500 /im > 5a extracted sources 
have sufficient simulated input sources available to boost their flux 
densities by a factor of 2 or more when their contributions are com- 
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Figure 14. The fractional flux density error for the corrected extracted cat- 
alogues, ignoring any sources that fall outside the 99.73rd percentiles. The 
dotted lines indicate the expected behaviour. 



bined, even though their individual effect is small. Figure [L3] also 
shows that this confusion becomes negligible for > 10<j sources. 
This is in broad agreement with Chapin et al. (201 1 1 who find that 
the sub-mm peaks they detect using a survey with larger beams, 
but of similar depth to H- ATLAS, generally consist of a blend of 
several sources. Future versions of MADX will include a deblend- 
ing step which should reduce this effect. It should be noted that a 
mean sky-background of 6.8 mjy, 5.8 mjy or 4.1 mjy at 250 /im, 
350 /im and 500 /im respectively (determined from the mean of the 
simulated input catalogue), is subtracted before the histograms are 
calculated, to account for the background-subtraction carried out 
as part of the source extraction process. 

As a check on the success of the correction factors in TableQ] 



they are applied to the full extracted catalogues and the fractional 
flux density errors (after rejecting the points which lie outside the 
99.73rd percentile) are then calculated. As Figure [14] shows, these 
reduce with increasing S/N, but, as with the positional errors dis- 
cussed previously, Eddington bias prevents this behaving exactly as 
expected. Again, when plotted against the S/N from the simulated 
input catalogue (Figure[l4cJ the difference is reduced. 

As well as the flux correction factors, we also need to com- 
pleteness of the detected catalogues, especially at faint 350 and 
500 /im flux densities; this is clearly seen in Figures [l5a| and [l6a| 
which compare the differential source counts for the extracted, sim- 
ulated input and flux-corrected catalogues. The lower counts are 
due to the failure to detect some fraction of faint sources because of 
random noise fluctuations in the simulated maps or source blend- 
ing. This incompleteness can be quantified by simply taking the 
ratio of the flux-corrected to simulated input differential counts, to 
give a source-surface-density correction. Note that this is not ap- 
propriate for correcting the flux densities of individual sources, but 
rather it can be applied when making statistical analyses of the cat- 
alogue as a whole. This correction is shown in Figures [T5b| and |T6b"| 
and also given as an additional correction factor in Table [2] Figures 
1 15c| and fl 6c [ demonstrate the success of the density correction when 
applied to the integral source counts. 

There is one further factor that can affect the extracted cata- 
logue - contamination from spurious sources. The expected num- 
ber of ^ 5a random noise peaks present in the 250 ^im map area 
is only ~0.05, so this should be negligible in the SDP catalogue. 
Contamination from fainter sources which are boosted or blended 
is accounted for in the flux correction factors. 

It should be noted that an alternative approach to correct- 
ing the SDP H-ATLAS catalogue was adopted in Clements et al. 
( |2010fr . In this case corrections were determined from the ratio of 
extracted to simulated input integral source counts. This combines 
the effects of incompleteness and flux boosting, and is appropri- 
ate for recovering the correct source counts, but not for correcting 
individual catalogue sources. 



5 CONCLUDING REMARKS 

This paper has presented the SDP catalogue for the first observa- 
tions of the H-ATLAS survey, along with a description of the sim- 
ulations created to determine the factors needed to correct it for 
the combined effects of incompleteness, flux-boosting and source 
blending. The main results of this analysis are summarised below: 

(i) The extracted flux densities of 350 /im and 500 /im sources 
can be enhanced over their simulated input values, by factors of up 
to ~2. This predominantly affects sources with 5 < S/N < 15; 

(ii) These enhancements are shown to be due to source blending, 
with ~27% of > 5cr 500 /im sources having sufficient simulated 
input sources available within a beam to create a boosting of ~2; 

(iii) A combination of flux density and source-surface-density 
corrections are necessary to correct the extracted source counts for 
these factors. 

It is anticipated that future development of the MADX soft- 
ware will incorporate subroutines to deal with both the effects of 
map pixelization and source blending in the processing stage. 

MADX is not the only source extraction method being con- 
sidered for the H-ATLAS data, but time constraints mean that it 
has been used for the SDP catalogue presented here. A comparison 
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between different source extraction algorithms is currently ongo- 
ing; these include SUSSEXtractor developed by Savage & Oliver 
l|2007}, as well as the 'matrix filter' method of Herranz et al. ( 2009 1 
and the 'Mexican Hat wavelet' method of Gonzalez-Nuev o~et al. 
(2006 ) and Lopez-Caniego et al. (2006|. The results of this com- 
parison will be used to improve future H-ATLAS catalogues. 

This initial, uncorrected, catalogue will be available from 
http : / / www . h-atlas . org, though it is expected that as the 
data processing steps are refined it will undergo future updates. 
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Figure 15. Extended source simulations 
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Table 1. The flux density correction factors (FC) at each SPIRE wavelength, as a function of S/N in the extracted catalogue, determined from the ratio of flux 
densities in the matched extracted and simulated input catalogues. To apply the correction at some catalogue flux density, / ca t: /corr = fcnt/FC, though 
note that the density correction given in Table|2]should also be applied as well. 
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Table 2. The surface density correction (SC) at each SPIRE wavelength as a function of corrected flux density, determined from the ratio of the flux-corrected 
to simulated input differential counts. To apply the correction at some corrected flux density, / CO rr: / C orr_final = fcon/SC. The corrected flux densities 
given are the central bin values. 



